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Identification of anew human coronavirus 
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Three human coronaviruses are known to exist: human coronavirus 229E (HCoV-229E), HCoV-OC43 and severe acute respiratory 
syndrome (SARS)-associated coronavirus (SARS-CoV). Here we report the identification of a fourth human coronavirus, HCoV- 
NL63, using a new method of virus discovery. The virus was isolated from a 7-month-old child suffering from bronchiolitis and 
conjunctivitis. The complete genome sequence indicates that this virus is not a recombinant, but rather a new group 1 
coronavirus. The in vitro host cell range of HCoV-NL63 is notable because it replicates on tertiary monkey kidney cells and the 
monkey kidney LLC-MK2 cell line. The viral genome contains distinctive features, including a unique N-terminal fragment within 
the spike protein. Screening of clinical specimens from individuals suffering from respiratory illness identified seven additional 
HCoV-NL63-infected individuals, indicating that the virus was widely spread within the human population. 


To date, there is still a variety of human diseases with unknown eti- 
ology. A viral origin has been suggested for many of these diseases, 
emphasizing the importance of a continuous search for new 
viruses!-3, Major difficulties are encountered, however, when 
searching for new viruses. First, some viruses do not replicate 
in vitro, at least not in the cells that are commonly used in viral diag- 
nostics. Second, for those viruses that do replicate in vitro and cause 
a cytopathic effect (CPE), the subsequent virus identification meth- 
ods may fail. Antibodies raised against known viruses may not rec- 
ognize the cultured virus, and virus-specific PCR methods may not 
amplify the new viral genome. To solve both problems, we devel- 
oped a new method for virus discovery based on the cDNA- 
amplified restriction fragment-length polymorphism technique 
(cDNA-AFLP*). Here we report the identification of a new 
coronavirus using this method of Virus-Discovery-cDNA- 
AFLP (VIDISCA). 

Coronaviruses, a genus of the Coronaviridae family, are enveloped 
viruses with a large plus-strand RNA genome. The genomic RNA is 
27-32 kb in size, capped and polyadenylated. Three serologically dis- 
tinct groups of coronaviruses have been described. Within each 
group, viruses are characterized by their host range and genome 
sequence. Coronaviruses have been identified in mice, rats, chickens, 
turkeys, swine, dogs, cats, rabbits, horses, cattle and humans, and can 
cause a variety of severe diseases including gastroenteritis and respi- 
ratory tract diseases**°. Three human coronaviruses have been stud- 
ied in detail. HCoV-229E and HCoV-OC43 were identified in the 
mid-1960s, and are known to cause the common cold’~!>. The 
recently identified SARS-CoV causes a life-threatening pneumonia, 
and is the most pathogenic human coronavirus identified thus 
far!©!8, SARS-CoV is likely to reside in an animal reservoir, and has 
recently initiated the epidemic in humans through zoonotic 


transmission!?°, It has been suggested that SARS-CoV is the first 
member of a fourth group of coronaviruses, or that it is an outlier of 
group 2 (refs. 21, 22). 

The new coronavirus that we present here was isolated from a child 
suffering from bronchiolitis and conjunctivitis. This was not an 
isolated case, as we identified the virus in clinical specimens from 
seven additional individuals, both infants and adults, during the last 
winter season. We also resolved the complete sequence of the viral 
genome, which revealed several unique features. 


RESULTS 

Virus isolation from a child with acute respiratory disease 

In January 2003, a 7-month-old child was admitted to the hospital 
with coryza, conjunctivitis and fever. Chest radiography revealed typi- 
cal features of bronchiolitis. A nasopharyngeal aspirate specimen was 
collected 5 d after the onset of disease (sample NL63). Diagnostic tests 
for respiratory syncytial virus, adenovirus, influenza viruses A and B, 
parainfluenza virus types 1, 2 and 3, rhinovirus, enterovirus, 
HCoV-229E and HCoV-OC43 yielded negative results. The clinical 
sample was subsequently inoculated onto human fetal lung fibro- 
blasts, tertiary monkey kidney cells (Cynomolgus monkey) and HeLa 
cells. CPE was detected exclusively on tertiary monkey kidney cells, 
and was first noted 8 d after inoculation. The CPE was diffuse, with a 
refractive appearance in the affected cells followed by cell detachment. 
More pronounced CPE was observed upon passage onto the monkey 
kidney cell line LLC-MK2, with overall cell rounding and moderate 
cell enlargement (Supplementary Fig. 1 online). Additional subcul- 
tures on human fetal lung fibroblasts, rhabdomyosarcoma cells and 
Vero cells remained negative for CPE. Immunofluorescence assays to 
detect respiratory syncytial virus, adenovirus, influenza viruses A and 
B, and parainfluenza virus types 1, 2 and 3 remained negative. Acid 
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lability and chloroform sensitivity tests indicated that the virus was 
most likely enveloped, and did not belong to the picornavirus group”. 


Virus discovery by the VIDISCA method 
Identification of unknown pathogens using molecular biology tools is 
difficult because the target sequence is not known, so genome-specific 
PCR primers cannot be designed. To overcome this problem, we 
developed the VIDISCA method based on the cDNA-AFLP tech- 
nique*. The advantage of VIDISCA is that prior knowledge of the 
sequence is not required, as the presence of restriction enzyme sites is 
sufficient to guarantee PCR amplification. The input sample can be 
either blood plasma or serum, or culture supernatant. Whereas 
cDNA-AFLP starts with isolated mRNA, VIDISCA begins with a 
treatment to selectively enrich for viral nucleic acid, including a cen- 
trifugation step to remove residual cells and mitochondria (Fig. 1a). A 
DNase treatment is also used to remove interfering chromosomal and 
mitochondrial DNA from degraded cells (viral nucleic acid is pro- 
tected within the viral particle). Finally, by choosing frequently cut- 
ting restriction enzymes, the method can be fine-tuned such that 
most viruses will be amplified. We were able to amplify viral nucleic 
acids in EDTA-treated plasma from a person with hepatitis B viral 
infection, and from a person with an acute parvovirus B19 infection 
(Fig. 1b). The technique can also detect HIV-1 in cell culture, demon- 
strating its capacity to identify both RNA and DNA viruses (Fig. 1b). 
The supernatant of the CPE-positive LLC-MK2 culture NL63 was 
analyzed by VIDISCA. The supernatant of uninfected cells was used 
as a negative control. After the second PCR amplification step, unique 
and prominent DNA fragments were present in the test sample but 
not in the control (1 of 16 selective PCR reactions is shown in Fig. 1c). 
These fragments were cloned and sequenced. Thirteen of 16 frag- 
ments showed sequence similarity to members of the coronavirus 
family, but significant sequence divergence with known coronaviruses 
was apparent in all fragments, indicating that we had identified a new 
coronavirus. The sequences of the 13 VIDISCA fragments are pro- 
vided in Supplementary Figure 2 online. 
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Figure 1 The VIDISCA method. (a) Schematic overview of steps in VIDISCA 
method. (b) Examples of VIDISCA-mediated virus identification. Specimens 
were analyzed using ethidium bromide-stained agarose (parvovirus B19) or 
Metaphor agarose (HBV and HIV-1) gel electrophoresis. Lane M, DNA 
molecular weight markers; —, negative controls; +, VIDISCA PCR products 
for HBV (amplified with primers HinP11-T/Msel-T), parvovirus B19 (HinP11 
standard primer only) or HIV-1 (EcoRI-A/Msel-C primers). (c) VIDISCA PCR 
products for HCoV-NL63. HinP1I-G and Msel-A primers were used for 
selective amplification; products were visualized by Metaphor agarose gel 
electrophoresis. Lanes 1 and 2, duplicate PCR product of cultured HCoV- 
NL63 harvested from LLC-Mk2 cells; 3 and 4, duplicate control 
supernatant from uninfected LLC-MK2 cells; 5 and 6, duplicate negative 
controls containing water; M, 25-bp molecular weight marker. Arrow 
indicates HCoV-NL63 fragment that was excised from gel and sequenced. 


Detection of HCoV-NL63 in patient specimens 

To show that HCoV-NL63 originated from the nasopharyngeal 
aspirate of the child, we designed a diagnostic RT-PCR that specifi- 
cally detects HCoV-NL63. This test confirmed the presence of 
HCoV-NL63 in the clinical sample. The sequence of the RT-PCR 
product of the 1b gene was identical to that of the virus identified 
upon in vitro passage in LLC-MK2 cells (data not shown). 

Having confirmed that the cultured coronavirus originated from 
the child, the question remained as to whether this was an isolated 
clinical case, or whether HCoV-NL63 is circulating in humans. To 
address this question, we used two diagnostic RT-PCR assays to exam- 
ine respiratory specimens of hospitalized individuals and those 
visiting the outpatient clinic between December 2002 and August 
2003 (Fig. 2). We identified seven additional individuals carrying 


° 
3 


Patients tested per month 
PS 
oO 


Dec Jan Feb Mar Apr May Jun Jul Aug 
02 #03 #403 «=03)0~«603:):«(O03:S03 038s 


HCoV-NL63-positive 
patients (%) 
OANWAATMNOO 


Dec Jan Feb Mar Apr May Jun Jul Aug 
02 #03 «#403 «#203)«€©03)=«(03-03-ss03-s«éOOB 


Figure 2 Detection of HCoV-NL63 in winter months of 2002 and 2003. 
(a) Number of patients tested per month. (b) Percentage of patients 
positive for HCoV-NL63. 
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HCoV-NL63 (Table 1). Sequence analysis of the PCR products indi- 
cated the presence of a few characteristic point mutations in several 
samples, suggesting that several viruses with different molecular 
markers may be cocirculating (Fig. 3 and Supplementary Fig. 3 
online). At least five of the HCoV-NL63-positive individuals suffered 
from respiratory tract illness; the clinical data of two individuals was 
not available. Including the index case, five of the patients were chil- 
dren less than 1 year old, and three patients were adults. Two adults 
were likely to be immunosuppressed, as one of them was a bone mar- 
row transplant recipient and the other an HIV-positive patient suffer- 
ing from AIDS, with very low CD4* cell counts (Table 1). No clinical 
data was available for the third adult. One patient was coinfected with 
respiratory syncytial virus (no. 72), and the HIV-infected patient 
(no. 466) carried Pneumocystis carinii. No other respiratory agent was 
found in the other patients, suggesting that the respiratory symptoms 
were caused by HCoV-NL63. All positive samples were collected dur- 
ing the last winter season, with a detection frequency of 7% in January 
2003. None of the 306 samples collected in the spring and summer of 
2003 contained HCoV-NL63 (P < 0.01 by two-tailed ¢ test). 


Complete genome analysis of HCoV-NL63 

The genomes of coronaviruses have a characteristic organization. The 
5’ two-thirds contain the la and 1b genes that encode the nonstruc- 
tural polyproteins, followed by the genes encoding four structural 
proteins: spike (S), envelope (E), membrane (M) and nucleocapsid 
(N). The genomes of known coronaviruses contain a variable number 
of unique characteristic open reading frames (ORFs) encoding non- 
structural proteins either between the 1b and S genes, between the 
S and E genes, between the M and N genes, or downstream of the 
N gene. 

To determine whether the HCoV-NL63 genome organization 
shares these characteristics, we constructed a cDNA library with puri- 
fied virus stock as input material. A total of 475 genome fragments 
were analyzed, with an average coverage of seven sequences per 
nucleotide. Specific PCR reactions were designed to fill in gaps and to 
sequence regions with low-quality sequence data. We combined this 
with 5’ and 3’ rapid amplification of cDNA ends to resolve the com- 
plete HCoV-NL63 genome sequence. 

The RNA genome of HCoV-NL63 consists of 27,553 nucleotides 
and a poly-A tail. With a GC content of 34%, HCoV-NL63 has the 
lowest GC content among the Coronaviridae, which range from 
37-42% (ref. 24). ZCurve software was used to identify the ORFs”°, 
and the genome configuration was portrayed using the similarity 
with known coronaviruses as a guide (Fig. 4a and Supplementary 
Table 1 online). Short untranslated regions (UTRs) of 286 and 287 
nucleotides are present at the 5’ and 3’ termini, respectively. The la 
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Figure 3 Phylogenetic analysis of RT-PCR sequences of the la gene from 
HCoV-NL63-positive patients. HCoV-229E was used to root the tree. 


Table 1 Patients positive for HCoV-NL63 


Patient no. Age Symptoms Specimen Sample date 
72 4 months URTI NPA 31 Dec 2002 
251 67 years Unknown OPA 7 Jan 2003 
223 52 years Unknown? OPA 8 Jan 2003 
NL634 7 months LRTI NPA 10 Jan 2003 
246 9 months LRTI OPA 13 Jan 2003 
248 11 months URTI OPA 16 Jan 2003 
466 AO years LRTI BAL 4 Feb 2003 
496 7 months URTI OPA 25 Feb 2003 


@Seven-month-old patient from whom HCoV-NL63 was cultured. Patient received bone mar- 
row transplant in 2001. Patient infected with HIV-1; 20 CD4* cells per mm3. URTI, upper 
respiratory tract illness; LRTI, lower respiratory tract illness; NPA, nasopharyngeal aspirate; 
OPA, oropharyngeal aspirate; BAL, bronchoalveolar lavage. 


and 1b genes encode the RNA polymerase and proteases that are 
essential for virus replication. A potential pseudoknot structure is 
present at position 12,439 (data not shown), which may provide the 
—1 frameshift signal to translate the 1b polyprotein. Genes predicted 
to encode the S, E, M and N proteins are found in the 3’ part of the 
genome. The hemagglutinin-esterase gene, which is present in some 
group 2 coronaviruses, is not present in HCoV-NL63. ORF3, located 
between the S and E genes, probably encodes a single accessory non- 
structural protein; this gene showed only limited similarity to ORF4A 
and ORF4B of HCoV-229E and ORF3 of porcine epidemic diarrhea 
virus (PEDV). 

The la and lab polyproteins are translated from the genomic RNA, 
but the remaining viral proteins are translated from subgenomic 
mRNAs made by discontinuous transcription during negative strand 
synthesis*®, Each subgenomic mRNA has a common 39’ end, derived 
from the 5’ portion of the genome (the 5’ leader sequence), and com- 
mon 3’ coterminal parts. Discontinuous transcription requires base- 
pairing between cis-acting transcription regulatory sequences (TRSs), 
one located near the 5’ part of the viral genome (the leader TRS) and 
others located upstream of each of the respective ORFs (the body 
TRSs)?”. The cDNA bank that we sequenced contained copies of the 
subgenomic mRNA for the N protein, thus providing the opportunity 
to exactly map the leader sequence that is fused to all subgenomic 
mRNAs. A leader of 72 nucleotides was identified at the 5’ UTR. 
Eleven of twelve nucleotides of the leader TRS (5’-UCU- 
CAACUAAAC-3’) showed similarity with the body TRS upstream of 
the N gene. Putative TRSs were also identified upstream of the S, 
ORF3, E and M genes (Supplementary Table 2 online). 

We next aligned the sequence of HCoV-NL63 with the complete 
genomes of other coronaviruses. The percentage nucleotide identity 
was determined for each gene and is listed in Table 2. All genes except 
the M gene shared the highest identity with HCoV-229E. To confirm 
that HCoV-NL63 is a new member of the group 1 coronaviruses, we 
conducted phylogenetic analysis using the nucleotide sequence of 
the la, 1b, S, M and N genes (Fig. 4b). For each gene analyzed, 
HCoV-NL63 clustered with the group 1 coronaviruses. The la, 1b 
and S genes of HCoV-NL63 are most closely related to those of 
HCoV-229E. However, further inspection revealed a subcluster of 
HCoV-NL63, HCoV-229E and PEDV. Phylogenetic analysis could 
not be performed for the ORF3 and E genes because the regions were 
too variable or too small for analysis, respectively. Bootscan analysis 
by the Simplot software version 2.5 (ref. 28) found no signs of 
recombination (data not shown). 

The presence of a single nonstructural gene between the S and E 
genes is noteworthy because almost all coronaviruses have two 
or more ORFs in this region, with the exception of PEDV and 
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Figure 4 HCoV-NL63 genome organization and phylogenetic analysis. 
(a) ORFs encoding 1a, 1b, S, ORF3, E, M and N proteins are flanked by 
286-nucleotide 5’ UTR and 287-nucleotide 3’ UTR. Coordinates of each 
ORF are provided in Supplementary Table 1 online. (b) Phylogenetic 
analysis of HCoV-NL63, using nucleotide sequences predicted to encode 
la, 1b, S, M and N proteins (see Supplementary Methods online for 
GenBank accession numbers). Red, group 1 viruses; blue, group 2; 
green, group 3; purple, SARS-CoV. MHV, mouse hepatitis virus; IBV, avian 
infectious bronchitis virus; BCoV, bovine coronavirus; FCoV, feline enteric 
coronavirus; CCoV, canine coronavirus; FIPV, feline infectious peritonitis 
virus; EqCoV, equine coronavirus; TCoV, turkey coronavirus. 


HCoV-OC43 (ref. 29,30). Perhaps most notable is a large insert of 537 
nucleotides in the 5’ portion of the S gene of HcoV-NL63, as com- 
pared with that of HCoV-229E. A BLAST search found no similarity 
between this additional 179—amino acid domain of the S protein and 
any coronavirus or other sequence deposited in GenBank. An align- 
ment of the HCoV-NL63 S protein sequence with those of other 
group 1 coronaviruses is shown in Supplementary Figure 4 online. 


DISCUSSION 
In this study we present a detailed description of a new human coron- 
avirus. Thus far, only three human coronaviruses have been charac- 
terized if we include SARS-CoV; further characterization of 
HCoV-NL63 as the fourth member will provide important insight 
into the variation among human coronaviruses. HCoV-NL63 is a 
member of the group 1 coronaviruses and is most closely related to 
HCoV-229E, but the differences between them are prominent. First, 
they share on average only 65% sequence identity. Second, a single 
gene, ORF3, in HCoV-NL63 takes the place of the 4A and 4B genes of 
HCoV-229E. Third, the 5’ region of the S gene of HCoV-NL63 con- 
tains a large in-frame insertion of 537 nucleotides. The N-terminal 
region of the S protein has been implicated in binding to aminopepti- 
dase N (group I coronaviruses) and sialic acid?!~*°, so the 179-amino 
acid insert in HCoV-NL63 might be involved in receptor binding and 
may explain the tropism of this virus in cell culture. However, the 
aminopeptidase N receptor-binding domain of the HCoV-229E S 
protein has been mapped to amino acids 407-547 (ref. 33), so it seems 
unlikely that the insertion will be directly involved in binding to 
aminopeptidase N. Fourth, whereas HCoV-229E is fastidious in cell 
culture with a narrow host range, HCoV-NL63 replicates efficiently in 
monkey kidney cells. SARS-CoV is also able to replicate in monkey 
kidney cells (Vero-E6 cells*#), yet the predicted S proteins of SARS- 
CoV and HCoV-NL63 do not share a domain that could explain the 
in vitro host cell range of these viruses. Other viral proteins may influ- 
ence the cell tropism of a virus, but none of the HCoV-NL63 proteins 
were more closely related to SARS-CoV than to HCoV-229E. 
Variability at the 5’ end of the S gene, correlating with alterations in 
tropism, has also been described for the group 1 coronaviruses 
porcine respiratory coronavirus (PRCoV) and transmissible gas- 
troenteritis virus (TGEV). These porcine viruses are antigenically and 
genetically related, but their pathogenicity is markedly different. 
TGEV replicates in and destroys the enterocytes of the small intestine, 
causing severe diarrhea with high mortality in neonatal swine. In con- 
trast, PRCoV (which emerged more recently than TGEV) has a selec- 
tive tropism for respiratory tissue, and very little capacity to replicate 
in intestinal tissue. The difference between the TGEV and PRCoV S 
gene sequences is comparable to the difference between those of 
HCoV-NL63 and HCoV-229E*°. Compared with TGEV, PRCoV con- 
tains a deletion in the 5’ hypervariable region of the S gene. The extra 
region that is present at the 5’ end of the TGEV S gene is responsible 
for the hemagglutination activity of TGEV, and its capacity to bind to 


NATURE MEDICINE VOLUME 10 | NUMBER 4 | APRIL 2004 


371 


& © 2004 Nature Publishing Group http://www.nature.com/naturemedicine 


ARTICLES 


Table 2 Percent nucleotide sequence identity between HCoV-NL63 
coronaviruses 


Group 1 Group 2 
Gene HCoV-229E  PEDV TGEV BCoV MHV 
la 63 57 50 40 38 
1b 75 72 69 58 57 
Ss 56 53 48 35 34 
ORF3 53 49 29 29 25 
E 59 54 49 37 39 
M 64 65 46 47 43 
N 52 38 45 30 29 


and other cDNA-AFLP was performed essentially as 

described‘, with some modifications. The double- 

stranded DNA was digested with the HinP1I and 
Group 3 Group 2/4 Mel restriction enzymes (New England Biolabs). 
IBV SARS-CoV Msel and HinP1I anchors (see below) were subse- 
37 38 quently added, along with 5U ligase enzyme 
59 58 (Invitrogen) in the supplied ligase buffer, for 2 h at 
32 32 37 °C. The Msel and HinP 1] anchors were prepared 
19 23 by mixing a top-strand oligonucleotide (5’-CTCG- 
25 40 TAGACTGCGTACC-3’ for the Msel anchor and 
43 42 5’-GACGATGAGTCCTGAC-3’ for the HinP1I 
31 30 anchor) with a bottom-strand oligonucleotide 


GenBank accession numbers for viruses are provided in Supplementary Methods online. 


sialic acid**. However, this region shows no similarity to the 
HCoV-NL63 insert. 

The common cold—causing virus HCoV-229E can cause more seri- 
ous respiratory disease in infants and immunocompromised 
patients*®”, Our data indicate that HCoV-NL63 causes acute respi- 
ratory disease in children below the age of 1 year, and in immuno- 
compromised adults. To date, no known viral pathogen can be 
identified in a substantial portion of respiratory disease cases in 
humans (20-30%; ref. 38). Several assays have been used to diagnose 
coronavirus infections. Traditionally, an antibody test is imple- 
mented to measure a rise in titers of antibodies to the human coron- 
aviruses HCoV-229E or HCoV-OC43 (ref. 12). Antibodies to 
HCoV-NL63 might cross-react with HCoV-229E, given that these 
viruses are members of the same serotype. If this were the case, 
HCoV-NL63_ infections might have been misdiagnosed as 
HCoV-229E. Molecular biology tools such as RT-PCR assays??? 
were designed to selectively detect the human coronaviruses 
HCoV-229E and HCoV-OC43, but these assays will not detect 
HCoV-NL63. Even the RT-PCR assay that was designed to amplify all 
known coronaviruses*? is not able to amplify HCoV-NL63 because 
of several mismatches with the primer sequences. The availability of 
the complete HCoV-NL63 genome sequence means that these 
diagnostic assays can be substantially improved. 

Our results indicate that HCoV-NL63 is present in a significant 
number of respiratory tract illnesses of unknown etiology. 
HCoV-NL63 was detected in patients suffering from respiratory 
disease, with a frequency of up to 7% in January 2003. The virus was 
not detected in more recent samples collected in the spring and 
summer of 2003, which correlates with the fact that human coron- 
aviruses tend to be transmitted predominantly in the winter season’. 
Future experiments with more sensitive diagnostic tools should yield 
a more accurate picture of the prevalence of this virus and its 
association with respiratory disease. 


METHODS 

VIDISCA method. The virus was cultured on LLC-MK2 cells. Details of virus 
culture and patient descriptions are available in Supplementary Methods 
online. To remove residual cells and mitochondria, 110 wl of virus culture 
supernatant was spun for 10 min at maximum speed (13,500 r.p.m.) in an 
Eppendorf microcentrifuge. To remove chromosomal DNA and mitochondr- 
ial DNA from the lysed cells, 100 ul of supernatant was transferred to a fresh 
tube and treated with DNase I for 45 min at 37 °C (Ambion). Nucleic acids 
were extracted as described*!. A reverse transcription reaction was performed 
with random hexamer primers (Amersham Bioscience) and Moloney murine 
leukemia virus reverse transcriptase (MMLV-RT; Invitrogen). Second-strand 
DNA synthesis was carried out with Sequenase II (Amersham Bioscience), 
without further addition of a primer. A phenol-chloroform extraction was 
followed by ethanol precipitation. 


(5’-TAGGTACGCAGTC-3’ for the Msel anchor 
and 5’- CGGTCAGGACTCAT-3’ for the HinP1I 
anchor) in a 1:40 dilution of ligase buffer. Twenty 
cycles of PCR were carried out with 10 ul of the ligation mixture, 100 ng of 
HinP 11 standard primer (5’°-GACGATGAGTCCTGACCGC-3’) and 100 ng of 
Msel standard primer (5’-CTCGTAGACTGCGTACCTAA-3’). Five microliters 
of this PCR product was used as input in the second ‘selective’ amplification 
step, along with 100 ng HinP 11 N-primer and 100 ng Msel N-primer (the ‘N’ 
indicates that the standard primers were extended with one nucleotide; G, A, T 
or C). The selective rounds of amplification were done using ‘touchdown 
PCR’: 10 cycles of 94 °C for 60 s, 65 °C for 30 s, and 72 °C for 1 min (annealing 
temperature reduced by 1 °C per cycle); 23 cycles of 94 °C for 30 s, 56 °C for 
30s, and 72 °C for 1 min; and finally 1 cycle of 72 °C for 10 min. Sixteen PCR 
reactions, each with 1 of the 16 primer combinations, were conducted for each 
sample in this selective PCR. The PCR products were analyzed on 4% 
Metaphor agarose gels (Cambrex), and the fragments of interest were cloned 
and sequenced using BigDye terminator reagents. Electrophoresis and data 
collection were performed using an ABI 377 instrument. DNA molecular 
weight markers were from Invitrogen and Eurogentec. 

To detect HIV-1, we used VIDISCA with EcoRI digestion instead of HinP11 
digestion. VIDISCA was modified for parvovirus B19 detection as follows: the 
reverse transcription step was excluded; only HinP1I digestion and adaptor 
ligation were performed; the first PCR reaction was performed with 35 cycles 
instead of 20; and the first PCR fragments were visualized by agarose gel elec- 
trophoresis. Details of cDNA library construction and full genome sequencing 
are available in Supplementary Methods online. 


Diagnostic RT-PCR. A total of 614 respiratory samples were collected from 
493 individuals between December 2002 and August 2003 at the Academic 
Medical Center in Amsterdam. The specimens included oral and nasopharyn- 
geal aspirates, throat swabs, bronchoalveolar lavage and sputum. The samples 
had been collected for routine viral diagnostic screening of people suffering 
from upper and/or lower respiratory tract diseases, and the patients consented 
that their samples be used for testing of respiratory viruses that included coro- 
naviruses. We used 100 tl of each sample in a Boom extraction*!. The diagnos- 
tic assay was designed based on the sequence of the 1b gene. The reverse 
transcription was performed with MMLV-RT (Invitrogen), using 10 ng of 
reverse transcription primer (repSZ-RT, 5’-CCACTATAAC-3’; coordinate 
16232 in HCoV-NL63). The entire reverse transcription mixture was added to 
the first PCR mixture containing 100 ng of primer repSZ-1 (5’-GTGATG- 
CATATGCTAATTTG-3%; coordinate 15973) and 100 ng of primer repSZ-3 
(5’-CTCTTGCAGGTATAATCCTA-3’; coordinate 16210). The PCR reaction 
consisted of the following steps: 95 °C for 5 min; then 35 cycles of 95 °C for 
1 min, 55 °C for 1 min, and 72 °C for 2 min; then 72 °C for 10 min. 

A nested PCR was started using 5 ul of the first PCR product with 100 ng of 
primer repSZ-2 (5’-TTGGTAAACAAAAGATAACT-3‘; coordinate 16012) and 
100 ng of primer repSZ-4 (5’-TCAATGCTATAAACAGTCAT-3’; coordinate 
16181). Twenty-five PCR cycles were performed using the same profile as the 
first PCR. Ten microliters of each PCR product was analyzed by agarose gel 
electrophoresis. All positive samples were repeated and sequenced to confirm 
the presence of HCoV-NL63. To verify negative and positive PCR results, an 
additional diagnostic RT-PCR assay was conducted using the la gene primers 
5’-AATATGTCTAACAAATAAAACGATT-3’ (reverse transcriptase primer 
P4H10-3; coordinate 6667), 5’-CTTTTGATAACGGTCACTATG-3’ (SS 5852- 
5P; 5777) 5’-CTCAT TACATAAAACATCAAACGG-3’ 


coordinate and 
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(P4G1M-5-3P; coordinate 6616) in the first PCR; and 5’-GGTCACTATG- 


TAGTTTATGATG-3’ 


(P3E2-5P; coordinate 5788) and 5. 


GGATTTTTCATAACCACTTAC-3’ (SS 6375-3P; coordinate 6313) in the 
nested PCR. Details of sequence analysis are available in Supplementary 
Methods online. 


GenBank accession numbers. The HCoV-NL63 sequences were deposited in 
GenBank under accession numbers AY567487—AY567494. 


Note: Supplementary information is available on the Nature Medicine website. 
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